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l BACKGROUND OF THE INVENTION 

2 

3 7. Field of Invention 
4 

5 This invention relates to data storage systems. 

6 

7 2. Related Art 
8 

£3 

v 3 9 Snapshots of a file system capture the contents of the files and directories in 

*Q 

Wo a file system at a particular point in time. Such snapshots have several uses. They allow 

ru 

the users of the file system to recover earlier versions of a file following an unintended 

ii 

e 12 deletion or modification. The contents of the snapshot can be copied to another storage 

u 

CCb device or medium to provide a backup copy of the file system; a snapshot can also be 
copied to another file server and used as a replica. The WAFL (Write Anywhere File 

: 

15 Layout) file system includes a copy-on- write snapshot mechanism. Snapshot block own- 

16 ership in WAFL has been recorded by updating the block's entry in a blockmap file, 

17 which is a bitmap indicating which blocks are in-use and which are free for use. 

18 

19 One problem with the prior art of creating snapshots is that the requirement 

20 for additional file system metadata in the active file system to keep track of which blocks 

21 snapshots occupy. These methods are inefficient both in their use of storage space and in 

22 the time needed to create the snapshots. 
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2 A second problem with earlier snapshot implementations, was the time 

3 consuming steps of writing out a description of the snapshot state on creation and re- 

4 moving it on deletion. 

5 

6 A third problem with earlier copy-on-write mechanisms, was the required 

7 steps consumed a considerable amount of time and file system space. For example, some 

8 systems, such as those supplied with DCE/DFS, include a copy-on-write mechanism for 
□ 9 creating snapshots (called "clones"). The copy-on-write mechanism was used to record 
?rto which blocks each clone occupied. Such systems require a new copy of the inode file 

ru 

iff l and the indirect blocks for all files and directories are created when updating all of the 

Kfo original inodes. 

w 

SJ4 Accordingly, it would be advantageous to provide an improved technique 

: *"Tj 

^5 for more quickly and efficiently capturing the contents of the files and directories in the 

16 file system at a particular point in time. This is achieved in an embodiment of the inven- 

17 tion that is not subject to the drawbacks of the related art. 

18 

19 SUMMARY OF THE INVENTION 

20 

21 The invention provides an improved method and apparatus for creating a 

22 snapshot of a file system. 
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l 

2 In a first aspect of the invention, the file system uses the fact that each 

3 snapshot includes a representation of the complete active file system as it was at the time 

4 the snapshot was made, including the blockmap of disk blocks indicating which ones are 

5 free and which ones are in use (herein called the "active map"). Because a record of 

6 which blocks are being used by the snapshot is included in the snapshot itself, the file 

7 system can create and delete snapshots very quickly. The file system uses those recorded 

8 blockmaps (herein called "snapmaps") as a source of information to determine which 
^ 9 blocks cannot be reused because those blocks are being used by one or more snapshots. 

y 

% l In a second aspect of the invention, the file system uses that fact that it need 

in 

f ^2 only maintain a more limited blockmap of those disk blocks in use by the active file sys- 

3 

C313 tern, and a summary map of those disk blocks in use by one or more snapshots. The 

[Q 

^4 summary map can be computed from the snapmaps as the logical inclusive-OR of all the 

r~i 

rti5 snapmaps. Because the file system need not maintain multiple bits of in-use/free data for 

16 each block, it uses the active map in conjunction with the summary map to determine 

17 whether blocks are in-use or free. 

18 

19 In a third aspect of the invention, the file system makes use of the fact that 

20 the summary map need not be updated every time a block is allocated or freed. Accord- 

21 ingly, the file system updates the summary map only (1) when a snapshot is deleted, and 

22 then only in a background operation, (2) on demand for areas for which write allocation 
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1 is about to be performed, and (3) periodically in a background operation for selected por- 

2 tions of the summary map. These background operations are preferably performed con- 

3 currently with other file system operations. 

4 

5 Information is stored in a persistent storage medium accessible by the file 

6 system, to provide for resumption of operation following a reboot operation. For exam- 

7 pie, in a preferred embodiment, relevant information is stored in the file system "fsinfo 

8 block" for each snapshot, to indicate whether the summary file needs to be updated using 
n 9 that snapshot's snapmap information as a consequence of its creation or deletion. When a 

iQO block is freed in the active file system, the corresponding block of the summary file is 

I 

f 4 1 updated with the snapmap from the most recently created snapshot, if this has not already 

in 

been done. An in-core bit map records the completed updates to avoid repeating them 

Q3 unnecessarily. This ensures that the combination of the active bitmap and the summary 

J3-B. 

iy 

~f\4 file will consistently identify all blocks that are currently in use. Additionally, the sum- 

Hs mary file is updated to reflect the effect of any recent snapshot deletions when freeing a 

16 block in the active file system. This allows reuse of blocks that are now entirely free. 

17 After updating the summary file following a snapshot creation or deletion, the corre- 

18 sponding bit in the fsinfo block is adjusted. 

19 

20 In a fourth aspect of the invention, the algorithm for deleting a snapshot in- 

21 volves examining the snapmaps of the deleted snapshot and the snapmaps of the next 

22 oldest and next youngest snapshot. A block that was used by the deleted snapshot but is 
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1 not used by its neighbors can be marked free in the summary file, as no remaining snap- 

2 shot is using it. However, these freed blocks cannot be reused immediately, as the snap- 

3 map of the deleted snapshot must be preserved until summary updating is complete. 

4 During a snapdelete free blocks are found by using the logical OR of the active bitmap, 

5 the summary file, and the snapmaps of all snapshots for which post-deletion updating is 

6 in progress. In other words, the snapmap of the deleted snapshot protects the snapshot 

7 from reuse until it is no longer needed for updating. 
8 

f=i 9 In the preferred embodiment, the invention is operative on WAFL file sys- 

Ugio tem. However, it is still possible for the invention to be applied to any computer data 

f Mi l storage system such as a database system or a store and forward system such as cache or 

s in 

: 

f $2 RAM if the data is kept for a limited period of time. 

3 

C33 

w 

K4 BRIEF DESCRIPTION OF THE DRAWINGS 

16 Figure 1 shows a block diagram of a system for an instant snapshot. 

17 

18 Figure 2 shows a block diagram of an instant snapshot. 

19 

20 Figure 3 shows a flow diagram of a method for creating a snapshot. 

21 

22 Figure 4 shows a flow diagram of a method for updating a summary map. 
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Figure 5 shows a block diagram of copy-on-write maintenance of the active 

map. 

INCORPORATED DISCLOSURES 

The inventions described herein can be used in conjunction with inventions 
described in the following applications: 

• U.S. Patent Application Serial No. 09/642,063, Express Mail Mailing No. 
EL524781089US, filed August 18, 2000, in the name of Blake LEWIS, attorney 
docket number 103.1033.01, titled "Reserving File System Blocks". 

• U.S. Patent Application Serial No. 09/642,062, Express Mail Mailing No. 
EL524780242US, filed August 18, 2000, in the name of Rajesh SUNDARAM, 
attorney docket number 103.1034.01, titled "Dynamic Data Storage". 

• U.S. Patent Application Serial No. 09/642,066, Express Mail Mailing No. 
EL524780256US, filed August 18, 2000, in the name of Ray CHEN, attorney 
docket number 103.1047.01, titled "manipulation of Zombie Files and Evil-Twin 
Files". 
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• U.S. Patent Application Serial Number 09/642,064, in the names of Scott 
SCHOENTHAL, Express Mailing Number EL524781075US, titled "Persistent 
and Reliable Delivery of Event Messages", assigned to the same assignee, attorney 
docket number 103.1048.01, and all pending cases claiming the priority thereof. 



• U.S. Patent Application Serial Number 09/642,065, in the names of Douglas P. 
DOUCETTE, Express Mailing Number EL524781092US, titled "Improved Space 
Allocation in a Write Anywhere File System", assigned to the same assignee, at- 
torney docket number 103.1045.01, and all pending cases claiming the priority 
thereof. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

In the following description, a preferred embodiment of the invention is de- 
scribed with regard to preferred process steps and data structures. However, those skilled 
in the art would recognize, after perusal of this application, that embodiments of the in- 
vention might be implemented using a variety of other techniques without undue experi- 
mentation or further invention, and that such other techniques would be within the scope 
and spirit of the invention. 



and 
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l Lexicography 
2 

3 As used herein, use of the following terms refer or relate to aspects of the 

4 invention as described below. The general meaning of these terms is intended to be il- 

5 history and in no way limiting. 



6 

7 • fsinfo (File System Information Block) — In general, the phrase "file system in- 

8 formation block" refers to one or more copies of a block known as the "fsinfo 
t~ t 9 block". These blocks are located at fixed locations on the disks. The fsinfo block 

%ss? 

=3. 

; ; s 

\\I0 includes data about the volume including the size of the volume, volume level op- 

; a 

! ~J i tions, language and more. 

mi 

Pb • WAFL (Write Anywhere File Layout) — In general, the term "WAFL" refers to a 

f n 

[14 high level structure for a file system. Pointers are used for locating data. All the 

O 

Ml 5 data is included in files. These files can be written anywhere on the disk in chunks 

16 of file blocks placed in data storage blocks. 

17 

18 • Consistency Point (CP) — In general, the term "CP" refers to a time that a file 

19 system reaches a consistent state. When this state is reached, all the files have 

20 been written to all the blocks and are safely on disk and the one or more copies of 

21 redundant fsinfo blocks get written out. If the system crashes before the fsinfo 
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1 blocks go out, all other changes are lost and the system reverts back to the last CP. 

2 The file system advances atomically from one CP to the next. 

3 

4 • Consistent State — In general, the phrase "consistent state" refers to the system 

5 configuration of files in blocks after the CP is reached. 

6 

7 • Active file system — In general, the phrase "active file system" refers to the cur- 

8 rent file system arrived at with the most recent CP. In the preferred embodiment, 
p 9 the active file system includes the active map, the summary map and points to all 
%ao snapshots and other data storage blocks through a hierarchy of inodes, indirect 
: || l data storage blocks and more. 

m 2 

Gl3 • Active map — In general, the phrase "active map" refers to a to a file including a 

ij. 

U4 bitmap associated with the in-use or free status of blocks of the active file system. 

Si 5 

16 • Snapshot — In general, the term "snapshot" refers to a copy of the file system. 

17 The snapshot diverges from the active file system over time as the active file sys- 

18 tern is modified. A snapshot can be used to return the file system to a particular 

19 CP (consistency point). 

20 

21 • Snapmap — In general, the term "snapmap" refers to a file including a bitmap as- 

22 sociated with the vacancy of blocks of a snapshot. The active map diverges from a 
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snapmap over time as the blocks used by the active file system change during con- 
sistency points. 

Summary map — In general, the term "summary map" refers to a file including an 
IOR (inclusive OR) bitmap of all the snapmaps. 

Space map — In general, the term "space map" refers to a file including an array 
of numbers which describe the number of storage blocks used in an allocation 
area. 

Blockmap — In general, the term "blockmap" refers to a map describing the status 
of the blocks in the file system. 

Snapdelete — In general, the term "snapdelete" refers to an operation that removes 
a particular snapshot from the file system. This command can allow a storage 
block to be freed for reallocation provided no other snapshot or the active file 
system uses the storage block. 

Snapcreate — In general, the term "snapcreate" refers to the operation of retaining 
a consistency point and preserving it as a snapshot. 
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1 As described herein, the scope and spirit of the invention is not limited to 

2 any of the definitions or specific examples shown therein, but is intended to include the 

3 most general concepts embodied by these and other terms. 



5 System Elements 



6 

7 



Figure 1 shows a block diagram of a system for an instant snapshot. 



,n9 



The root block 100 includes the inode of the inode file 105 plus other in- 

Wo formation regarding the active file system 110, the active map 115, previous active file 

m 

systems known as snapshots 120, 125, 130 and 135 and their respective snapmaps 140, 
J i2 145, 150 and 155. 



in 



s 



f M3 



12 



A 



The active map 115 oV the active file system 110 is a bitmap associated 

15 with the in-use or free status of blocks\or the active file system 110. The respective 

16 snapmaps 140, 145, 150 and 155 are active Wps can be associated with particular snap- 

17 shots 120, 125, 130 and 135 and an inclusive X)R summary map 160 of the snapmaps 

18 140, 145, 150 and 155. Also shown are other blockssj 15 including double indirect blocks 

19 130 and 132, indirect blocks 165, 166 and 167 and dafca blocks 170, 171, 172 and 173. 

20 Finally, Figure 1 shows the spacemap 180 including a collection of spacemap blocks of 

21 numbers 181, 182, 183, 184 and 190. 

22 
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1 The root block 100 includes a collection of pointers that are written to the 

2 file system when the system has reached a new CP (consistency point). The pointers are 

3 aimed at a set of indirect (or triple indirect, or double indirect) inode blocks (not shown) 

4 or directly to the inode file 105 consisting of a set of blocks known as inode blocks 191, 

5 192, 193, 194 and 195. 

6 

7 The number of total blocks determines the number of indirect layers of 

8 blocks in the file system. The root block 100 includes a standard quantity of data, such as 

Q 9 128 bytes. 64 of these 128 bytes describe file size and other properties; the remaining 64 

*£) 

sTio bytes are a collection of pointers to the inode blocks 191, 192, 193, 194 and 195 in the 

fU 

IH l inode file 105. Each pointer in the preferred embodiment is made of 4 bytes. Thus, there 

^2 are approximately 16 pointer entries in the root block 100 aimed at 16 corresponding 

?3 

JSp inode blocks of the inode file 105 each including 4K bytes. If there are more than 16 

NM inode blocks, indirect inode blocks are used. 

Q 

16^ M In a preferred embodime^^le blocks are 4096 bytes and inodes are 128 

17 bytes( It follows that each block of the inode rite contains 32 (i.e. 4,096/128) separate 

18 inodes that point to other blocks 1 15 in the active fil^ystem. 

19 

20 Inode block 193 in the inode file 105 points to a set of blocks (1, 2, 3, P) 

21 called the active map 115. Each block in the active map 1 15 is a bitmap where each bit 

22 corresponds to a block in the entire volume. A "1" in a particular position in the bitmap 
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1 correlates with a particular allocated block in the active file system 110. Conversely, a 

2 "0" correlates to the particular block being unused by the active file system 110. Since 

3 each block in the active map 115 can describe up to 32K blocks or 128 MB, 8 blocks are 

4 required per GB, 8K blocks per TB. 



^7 



Another inode block nj the inode file 105 is inode block N 212. This block 



7 jjhcludes a set of pointers to a collection of snapshots 120, 125, 130 and 135 of the vol- 

8 ume. Each snapshot includes all the infoiWtion of a root block and is equivalent to an 
p9 older root block from a previous active file system. The snapshot 120 may be created at 

=3, \ 

s£ko any past CP. Regardless when the snapshot is cre^ed, the snapshot is an exact copy of 
the active file system at that time. The newest snapshot 120 includes a collection of 

£02 pointers that are aimed directly or indirectly to the same iri^de file 105 as the root block 

Q3 100 of the active file system 1 10. 

ha 5 As the active file system 1 10 changes (generally fromWriting files, deleting 

16 files, changing attributes of files, renaming file, modifying their conrents and related ac- 

17 tivities), the active file system and snapshot will diverge over time. Given the slow rate 

18 of divergence of an active file system from a snapshot, any two snapshots will share 

19 many of the same blocks. The newest snapshot 120 is associated with Vnapmap 140. 

20 Snapmap 140 is a bit map that is initially identical to the active map 115A The older 

21 snapshots 125, 130 and 194 have a corresponding collection of snapmaps 145\ 150 and 

22 155. Like the active map 1 15, these snapmaps 145, 150 and 155 include a set or\blocks 
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including bitmaps that correspond^ allocated and free blocks for the particular CP when 
the particular snapmaps 145, 150 and r§5 were created. Any active file system may have 

3 a structure that includes pointers to one or iVe snapshots. Snapshots are identical to the 

4 active file system when they are created. It folhs*ws that snapshots contain pointers to 

5 older snapshots. There can be a large nimiber of pre^ous snapshots in any active file 

6 system or snapshot. In the event that there are no snapshotSJiere will be no pointers in 

7 the active file system. 

8 

£ «j 9 Blocks not used in the active file system 110 are not necessarily available 

#0 for allocation or reallocation because the blocks may be used by snapshots. Blocks used 



* a S 



:9i by snapshots are freed by removing a snapshot using the snapdelete command. When a 

10 



12 snapshot is deleted any block used only by that snapshot and not by other snapshots nor 

3 

C3i3 by the active file system becomes free for reuse by WAFL. If no other snapshot or active 

Hi 4 files uses the block, then the block can be freed, and then written over during the next 

O 

Ml 5 copy-on- write operation by WAFL. 

16 

17 The system can relatively efficiently determine whether a block can be re- 

18 moved using the "nearest neighbor rule". If the previous and next snapshot do not allo- 

19 cate a particular block in their respective snapmaps, then the block can be freed for reuse 

20 by WAFL. For WAFL to find free space to write new data or metadata, it could search 

21 the active map 115 and the snapmaps (140, 145, 150 and 155) of the snapshots (120, 125, 
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1 130 and 135) to find blocks that are totally unused. This would be very inefficient; thus it 

2 is preferable to use the active map and the summary map as described below 

3 

A summary map 1 60 is created by using an IOR (inclusive OR) operation 

5 139 on the snapmaps M0, 145, 150 and 155. Like the active map 1 15 and the snapmaps 

6 140, 145, 150 and 155, the summary map 160 is a file whose data blocks (1, 2, 3, ...Q) 

7 contained a bit map. Each Vt in each block of the summary map. describes the alloca- 

8 tion status of one block in theXsystem with "1" being allocated and "0" being free. The 
3 9 summary map 1 60 describes the allocated and free blocks of the entire volume from all 

S3. \ 

y \ 

So the snapshots 120, 125, 130 and 135\combined. The use of the summary file 160 is to 

3 1 avoid overwriting blocks in use by snapshots. 
812 \ 

A3 An IOR operation on sets of blocks (such as 1,024 blocks) of the active 

44 map 115 and the summary map 160 produces a spacemap 180. Unlike the active map 

as 115 and the summary map 160, which are a set of blobks containing bitmaps, the space- 

16 map 180 is a set of blocks including 181, 182, 183, 184, ahd 190 containing arrays of bi- 

17 nary numbers. The binary numbers in the array represent thk addition of all the vacant 

18 blocks in a region containing a fixed number of blocks, such as 1^024 blocks. The array 

19 of binary numbers in the single spacemap block 181 represents tfle allocation of all 

20 blocks for all snapshots and the active file system in one range of 1,024 blocks. Each of 

21 the binary numbers 181, 182, 183, 184, and 190 in the array are a fixed length. In a pre- 
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1 / ferred embodiment, the binary numbers are 16 bit numbers, although only 10 bits are 

2 used. 



4 In a preferred embodiment, the large spacemap array binary number 1 82 

5 (0000001 1 1 1 1 1 1 1 10=1,021 in decimal units) tells the file system that the corresponding 

6 range is relatively full. In such embodiments, the largest binary number 

7 00001111111111 (1,023 in decimal) represents a range containing at most one empty.. 

8 The small binary number 184 (0000000000001 1 10=13 in decimal units) instructs the file 

C3 

•jq 9 system that the related range is relatively empty. The spacemap 1 80 is thus a representa- 

\Q 

Wo tion in a very compact form of the allocation of all the blocks in the volume broken into 

J M 

T Ji 1,024 block sections. Each 16 bit number in the array of the spacemap 180 corresponds 

s 12 to the allocations of blocks in the range containing 1,024 blocks or about 4 MB. Each 
O 

Kb spacemap block 180 has about 2,000 binary numbers in the array and they describe the 

f 24 allocation status for 8 GB. Unlike the summary map 120, the spacemap block 180 needs 

ass* 

15 to be determined whenever a file needs to be written. 

16 

17 Figure 2 shows a block diagram of an instant snapshot. 

18 

19 The old root block 200 of snapshot #1 201 includes the inode of the inode 

20 file 202 plus other information regarding the previous active file system known as snap- 

21 shot #1 201, the snapmap 205, earlier active file systems known as snapshot #2 210, 

22 snapshot #3 215 and snapshot #4 220, and their respective snapmaps 225, 230 and 235. 
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^p 8 ^^ / The snapm^205 of the previous active file system, snapshot #1 201, is a 

bptmap associated with the va^ncy of blocks for snapshot #1 201. The respective snap- 



3 
4 
5 



*3 9 

S3. 

in 

* 12 

Q 



15 
16 
17 
18 
19 
20 



maps 225, 230 and 235 are earlier active maps that can be associated with particular 
snapshots 210, 215 and 220 and an inclusive OR summary map 245 of the snapmaps 225, 



6 230 and 235. Also shown are other blocks 21 1 including double indirect blocks 240 and 
332, indirect blocks 250, 251 and 252 and d^ blocks 260, 262, 263 and 264. Finally, 
Figure 2 shows the spacemap 270 of snapshot #1^01 including a collection of spacemap 
blocks of binary numbers 272, 273, 274, 275 and 276. 



The old root block 200 includes a collection m pointers that are written to 
the previous active file system when the system had reached tne previous CP. The point- 
ers are aimed at a set of indirect (or triple indirect, or double indirect) inode blocks (not 
shown) or directly to the inode file 202 consisting of a set of blocks known as inode 
blocks 281, 282, 283, 284 and 285. 

An inode block 281 in the inode file 202 points to other blbcks 328 in the 
old root block 201 starting with double indirect blocks 240 and 332 (there\ could also be 
triple indirect blocks). The double indirect blocks 240 and 332 include pointers to indi- 
rect blocks 250, 251 and 252. The indirect blocks 250, 251 and 252 include pointers that 



21 are directed to data leaf blocks 260, 262, 263 and 264 of the active file system lOl . 



22 
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^A^y Inode block 2Ss\m the inode file 202 points to a set of blocks (1, 2, 3, P) 

C called the snapmap 205. Each blofck in the snapmap 205 is a bitmap where each bit cor- 

3 responds to a block in the entire volumeNA M 1" in a particular position in the bitmap cor- 

4 relates with a particular allocated block in the^ctive file system 201. Conversely, a "0" 

5 correlates to the particular block being free for allocjHkm in the old root block 201 . Each 

6 block in the snapmap 205 can describe up to 32K blocks oK{28 MB. 

7 

8 Inode file 202 also includes inode block N 285. This block includes a set of 

w 

M39 pointers to a collection of earlier snapshots, snapshot #2 210, shapshot #3 215 and snap- 

jMo shot #4 220 of the volume. Each snapshot includes all the information of a root block 

FU 

: si 

U l and is equivalent to an older root block from a previous active file system. 



CO 
s 12 



b*1 



^ % t Snapshot #1 201 also includes an old summary map 245 and old spacemap 

J _ 
r44 blocks 270. Although these blocks of data are included in snapshot #1 201 and previous 

15 snapshots, in a preferred embodiment, this dataHs not used by the active file system of 

16 figure 2. \ 

17 

18 Method of Use 
19 

20 Figure 3 shows a flow diagram of a method for using a system as shown in 

21 figure 1. 

22 
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^Jj^l""! A metflpd 300 is performed by the file system 100. Although the method 

2 300 is described serialV, the steps of the method 300 can be performed by separate ele- 

3 merits in conjunction or m parallel, whether asynchronously, in a pipelined manner, or 

4 otherwise. There is no particular requirement that the method 300 be performed in the 

5 same order in which this description lists the steps, except where so indicated. 



At a flow point 305, th\ file system 100 is ready to perform a method 300. 



□ 9 



!3i 



U3 



U 4 

U5 



16 
17 
18 



20 
21 

22 



At a step 3 10, a user will request a snapshot of the file system 100. 



At a step 315, a timer associated \ith the file system 100 initiates the crea- 
M2 tion of a new snapshot. 



At a step 320, the file system 100 received a request to make a snapshot. 



At a step 325, the file system 100 creates a new file. 



At a step 330, the root node of the new file points to the root node of the 



19 current active file system. 



At a step 335, the file system 100 makes the file read only. 
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^|^"| At a step 340, th\file system 100 updates the new summary map by using 

2 an inclusive OR of the most recenT^apmap and the existing summary file. This step 

3 must be done before any blocks are freecPm the corresponding active map block. If mul- 

4 tiple snapshots are created such that the processing overlaps in time, the update in step 

5 340 need only be done for the most recently created snapshot. 



At a flow point 345, the snapshot create and the summary file update is 



8 completed and the snapshot creation is done. 



^flo 



fit 

%? 2 



£02 



W3 



An analogous method may be performed for snapshot delete. 



Figure 4 shows a flow diagram of a method for updating a summary map. 



[l^^^j A method 400 is performed by the file system 100. Although the method 

C3 

ids 400 is described serially, the steps of the method 400 can be performed by separate ele- 

16 ments in conjunction or in parallel, whether asynchronously, in a pipelined manner, or 

17 otherwise. There is no particular requirement that the method 400 be performed in the 

18 same order in which this description lists the stepfcL except where so indicated. 



19 



20 



At a flow point 410, the file system 10(Ais ready to update the summary 



21 map. 



22 
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At a step 41 1, update of the summary map is triggered by a "snapdelete" 



2 command from, an operator or user. As part of this step, the file system 100 receives and 

3 recognizes the "sn&pdelete" command. 



5 At a step 412, the file system 100 responds immediately to the operator or 

6 user, and is ready to receivevanother operator or user command. However, while the op- 

7 erator or user sees a substantial^ immediate response, the file system 100 continues with 

8 the method 400 to process the "snapdelete" command. 



£3 9 

■ n 

fu 
--4 

502 



At a step 413, the file system 100 marks an entry in the fsinfo block to 
show that the selected snapshot (designate^ by the "snapdelete" command) has been de- 
leted. 

At a step 414, the file system 100 examines the snapmap for the selected 
snapshot for blocks that were in use by the selected snapshot, but might now be eligible 
to be freed. 

At a step 415, the file system 100 examines tM snapmaps for (A) a snap- 

19 shot just prior to the selected snapshot, and (B) a snapshot just after the selected snapshot. 

20 For blocks that were in use by the selected snapshot, the file system 100 sets the associ- 

21 ated bit to indicate the block is FREE, only if both of those snapmap\show that the block 

22 was free for those snapshots as well. 



U3 
t*14 
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2 The method 400 continues with the flow point 440. 

3 

tj\J^^l ^ a StC ^ upd^eof the summary map is triggered by a write allocation 

5 operation by the file system 100. In ^preferred embodiment, a write allocation operation 

6 occurs for a selected section of the mass storage. The "write allocation" operation refers 

7 to selection of free blocks to be seized and writW to, as part of flushing data from a set 

8 of memory buffers to mass storage. As part of this stbp, the file system 100 determines a 
□ 9 portion of the summary map corresponding to the selected section of the mass storage. 



;j4 

a 

M5 



At a step 422, the file system 100 recalculates the summary map for the 



f§2 portion of the summary map corresponding to the selected section of the mass storage. 



17 erat 



The method 400 continues with the flow point 440. 

At a step\43 1 , update of the summary map is triggered by a background op- 
on. In a preferred embodiment, the file system 100 updates about one 4K data block 



18 of the summary map. 
19 

20 At a step 432, the fil\ system 100 recalculates the summary map for the 

21 portion of the summary map selected to\e updated. 

22 
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l The method 400 continues with the flow point 440. 

2 

3 At a flow point 440, the file system 1 10 has updated at least a portion of the 

4 summary map, and is ready to be triggered for further updates later. 

5 

6 Figure 5 shows a block diagram of copy-on-write maintenance of the active 

7 map. 

8 

O9 When blocks are freed in the active map, the file system 110 is careful to 

; ; I 
. P% 

iTJo not reuse those blocks until after a consistency point has passed (and thus that the newly 

FU 

If! 1 free status of the block has been recorded in a snapshot). Accordingly, the file system 
—u 1 10 maintains two copies of the active map, a "true" copy 501 and a "safe" copy 502. 

K 

£=3. 

u 
[ri 3 

il 

Si 4 In normal operation 510 (outside a time when a consistency point is being 

H5 generated), the file system 110 maintains both the "true" copy 501 and the "safe" copy 

16 502 of the active map. Since in normal operation 510 blocks can only be freed, not allo- 

17 cated, only changes from IN-USE to FREE are allowed. The file system 1 10 makes all 

18 such changes in the "true" copy 501, but does not make them to the "safe" copy 502. The 

19 "safe" copy 502 therefore indicates those blocks which can be safely allocated at the next 

20 consistency point. 
21 
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1 While generating a consistency point, during a write allocation interval 520, 

2 blocks can be either freed (by continued operation of the file system 1 10) or allocated (by 

3 the write allocation operation). Both types of change are made to both the "true" copy 

4 501 and the "safe" copy 502. 

5 

6 While still generating a consistency point, during a flush data to disk inter- 

7 val 530, blocks can again only be freed (by continued operation of the file system 110); 

8 they cannot be allocated because the write allocation interval 520 is finished for that con- 
39 sistency point. The file system 110 makes all such changes in the "safe" copy 502, but 
fio does not make them to the "true" copy 501. At the end of the flush data to disk interval 

P3 5 

sy 

Lfli 530, the file system 110 switches the roles of the "true" copy 501 and the "safe" copy 

U. 3 

^2 502, so that all such changes were in fact made to the new "true" copy 501 only. 

£3 

J 

\14 Alternative Embodiments 

H 

16 Although preferred embodiments are disclosed herein, many variations are 

17 possible which remain within the concept, scope, and spirit of the invention, and these 

18 variations would become clear to those skilled in the art after perusal of this application. 
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